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(54) Tumour suppressor gene 



(57) A detailed genetic map on hi 
1 1 was prepared. Then, a commonly deleted region on 
the chromosome in the tumor tissues of patients with 
multiple endocrine neoplasia type 1 was identified. Fur- 
ther, by the linkage analysis on a family line with this dis- 
ease, a gene causative of this disease was localized. A 
gene preserrt in the region common to these observa- 
tions was cloned and the structure of this gene was 
determined. Because a protein coded by this DNA is 
homologous with those of transcriptional factors, it is 
expected that the above-mentioned gene may be a 
novel tumor suppressor gene. Further, it is also 
expected that the above-mentioned gene and a protein , 
coded for thereby may be useful in preparations of a 
remedy for cancer and a diagnostic drug for cancer. 
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Background of the Invention 
s Held of the Invention 

10 Description of the Related Art 



Disclosure of the Invention 
Summary of the Invention 



rZTS' J^f CCdi " 9 30 amino add beins homologous with transcriptional factors suchZ 

human Wilms tumor suppressor gene (WT1) product and human early growth Response DroMn* zfBMa?^!^ 

e*^ "ST ^ ^ reSPOndS 10 Vari0US and ^•no»S^^SSSSS?S 

s^E "r'T 5 '™ 6 ' drCU,a,0r > ^ o"**™ systems^ being input iSKS^^S.^S 

a lrv.no organs as it is and cancer breaks out when this coordinXn falls inSrter ^ 
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cDNA in the full length is an intranuclear transcriptional regulator having a nuclear localizing signal, a proline-rich 
domain and a zinc finger motif. 

Thus, the present invention relates to: 

(1) a DNA comprising the full structure or a part of the DNA represented by SEQ ID NO:1 ; 

(2) a polypeptide comprising the full structure or a part of the polypeptide coded for by the DNA represented by 
SEQIDNO:1; 

(3) a transformant transformed by a plasmid having, integrated therein, the full structure or a part of the DNA rep- 
resented by SEQ ID NO:1 which can be expressed therein; 

(4) an antibody against the above-mentioned polypeptide as an antigen; and 

(5) a gene analysis method which comprises using, as a primer, a probe or a marker, a DNA comprising a part of 
the DNA represented by SEQ ID NO:1 and hybridizing the primer, the probe or the marker with a DNA to be tested. 

In other words, the present invention relates to: 

(a) a cDNA which comprises one containing the full or a part of the cDNA of the tumor suppressor gene repre- 
sented by SEQ ID NO:1; 

(b) a polypeptide which comprises one containing the full or a part of the polypeptide coded for by the cDNA of the 
tumor suppressor gene represented by SEQ ID NO:1; 

(c) host cells which are obtained by integrating the full or a part of the cDNA described in SEQ ID N0:1 into a plas- 
mid which can express it and transforming thereby, 

(d) an antibody against the polypeptide described in the above Hem (b) as an antigen; and 

(e) a gene analysis method characterized by using a DNA containing a part of the DNA sequence described in the 
above item (a) as a primer, a probe or a marker. 

With respect to the DNAs and polypeptides, those which are substantially equivalent to the DNAs and polypeptides 
described above are also included in the scope of the present invention. The expression "DNAs and polypeptides being 
substantially equivalent" means those which have been modified via, for example, deletion, replacement, addition or 
insertion of the constituting bases or constituting amino acids and derivatives thereof, which exhibit the same effects as 
those of the original DNAs or polypeptides. However, the extent of these effects is irrelevant thereto. The term "a part 
of the DNA" means a fragment conposed of at least 10 bases derived from the DNA. In order to employ as a primer, 
for example, a DNA fragment having a base sequence generally consisting of 1 0 to 30 bases, preferably 1 5 to 25 bases, 
is selected. In order to employ as a probe, a DNA fragment having a base sequence generally consisting of at least 1 5 
bases, preferably at least 20 bases, is selected. « — 

The term "a part of the polypeptide" means a peptide having a sequence composed of at least 6 amino acid resi- 
dues derived from the polypeptide. When a part of a polypeptide is to be used as an antigen for the preparation of an 
antibody or as an epitope for the detection of an antibody, it is known that a peptide having a sequence consisting of 6 
amino acid residues would bind to an antibody [see WO 8403564, published on Sep. 13, 1984 (Assignee: COMMON- 
WEALTH SERUM LABS and GEYSEN. H. M.)]. A peptide having a sequence generally consisting of at least 10 amino 
acid residues, preferably at least 20 amino acid residues, is employed therefor. Although it may be anticipated that a 
peptide having a sequence consisting of 6 amino acid residues can achieve only a poor efficiency in the production of 
an antibody, such a peptide is also usable in the form of a fused peptide 

Furthermore, an RNA which comprises one translated from the DNA represented by SEQ ID NO:1 or a part of the 
same and RNAs which are substantially equivalent thereto are included in the scope of the present invention. 

Now the present invention will be described in greater detail. 



Detailed Description of the Invention 
(1) Isolation of cDNA 



s The target cosmid library of the human chromosome 1 1 can be Dreoared in for exarmte tho mi^™ ™, 

From humanlrrcuse hybrid cel. line confining a single human S^SeSZTa 
c*™ma.DNAis extracted. Then DNA fragments of me chrorr^rra) Dr^ can 

» .Sini ?X a ° neS , ^ 30 inSert in human chromosome can be^creened by SSS 

2r^n ^ 3 Wh0 ' e ^ DNA 35 Prrt)e - *»* <*> nes containing a DNA origT 

SL ^TTu r T S Tir *" SUb, ' eCted to tte flUOrescent irvsitu Nation (FISH) method [££ 
Srl^i 14 " 16 (1990)] - ^ each <* 1,16 '^eof thecosmiddonescan be tocalSf 

C ^TT e ^ f PhySiCal ™Pcanbe prepared. Further. RFLP markers canto S 

, 5 1 e^L^ ^ tes prepared b * deav, '"9 hu ™ DNA wrth several «S 

' 5 l 00 enzym ^ INakamura et al.. Am J. Hum. Genet. 43. 854 - 859 (1988)]. Among these clones, those located around 

mTTriqiaaresubie^ 

map. Based on ths map, the DNA of a cancer tissue of a patient is examined in the loss of heterozygosity Sflbv 
From the cosmri clones ewshng in the region which has been thus restricted to, a DNA fragment beina under 
fiPQivf ^tf^nliA exon trapping method [Buckler. A., et al., Proc Natl. Acad. So. USA^88, 40^££ -^10^)9 
q13 of human chromosome 11 can be cloned. region near 

(2) Confirmation of the whole structure of the gene 

25 

Pr rJ*lT? ° DNA * determined b V < he Maxam-Gibert method [Maxam. A.M. and Gilbert W 

H ™ ^ Tl 560 0 977)1 " ** did60Xy t6Chnique I Messin 9- J - Nudei c Res., a 309 (rS)] 

It can be confirmed by. for example, the 5'RACE method, the 3'RACE method or the Northern Hottinn iht L 
^ CDNA obfoined by the above-mentioned meU,od contains the full length prote7nSa^n in " 

(3) Rec 



The tumor suppressor gene cDNA obtained by the above-mentioned method, or a fragment thereof is inteorated 
as turing his transformant in a conventional manner, a large amount of the tumor suppressor gene product or a fraament 
of a vector suitable for the expression of the cDNA by a known method with the use of restriction enzym^ and MA 

££?2 ?J, f T V T * BI1 ° ^ 1 ' «« V6C,0r SV40 ori 9 ina1in 9 in viTusSgh any vector 
capable of reputing and amplrryng .n the host cells may be used therefor without restriction. Similan/the J m ZZ 

S^'tTL™ T ""T* ' n Mm ^ 3ny SuitaWe may be selected therefor deperS^on 

„ J V™ SUppreSSOf or a fra 9™"« thereof. Any DNA may be used as the cDNA herein so long as itSies for 

fte protein to be expressed is one having a physiological activity of suppressing the proliferation of cancer cells then 
frese^enceofthecDNAisr^re^^ 

ZZSSSSSt has under90ne ^ subsWufon ' ^ insertion " a ^^^^£3 

s» The recombinant expression vector thus obtained is introduced into a host bv for examnio f>» mm , otort ^, 
method [J. Mol Biof.. 53, 154 (1970)]. the protoplast method [Proc. NatilcL £ ust ^SfSKSL™ 
ornate method [Science. 22L 551 (1983)]. the h v«ro Ugfog method^S iSSs*T^ 

SSr a ^S,^r a IT' 0 ^ hOSt T " e ^ fe ""^ eflected at a temperature of from20 to 
45 C wrthin a pH range of from 5 to 8 and. if necessary, under aeration and/or stirring. The tumor suppressor aene orod 

^?r m en ^ ere ? ^ S JT * ^ ^ ^ 9,6 by ^Uery^fog^sepS- 
tionfisolation methods. Examples of these methods include salting out solvent precipitation, dialysis, gel ffltration. 
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electrophoresis, ion exchange chromatography, affinity chromatography and reversed phase high performance liquid 
chromatography. 

(4) Preparation of antibody 

5 

By using the tumor suppressor gene product or a fragment thereof as an antigen, an antibody is prepared. A poly- 
clonal antibody is prepared in accordance with a conventional method by, for example, sufficiently immunizing an ani- 
mal such as mouse, guinea pig and rabbit with the antigen by subcutaneously. intramuscularly, intraperitoneally or 
intravenously administering it a number of times, sampling the blood from the animal and then separating the serum to 
io obtain the antibody. A commercially available adjuvant is also usable therefor. 

A monoclonal antibody can be prepared by a known method. For example, spleen cells of a mouse immunized with 
the antigen described above are fused with commercially available mouse myeloma cells to thereby give hybridomas. 
Then the target monoclonal antibody can be prepared from the culture supernatant of the hybridoma or the ascites fluid 
of a mouse inoculated with the hybridoma. 
is It is not necessary that the tumor suppressor gene product to be used as the antigen has the whole amino acid 
structure but a peptide having a partial structure thereof, a modified peptide, its derivative or a fused peptide formed by 
fusing this peptide with another peptide are also usable. These substances may be prepared by any of the biological 
technique and chemical synthesis technique. 

These antibodies enable the identification and determination of the peptide of the present invention in human bio- 
20 logical samples and thus are applicable to, for example, diagnostic drugs for diseases to which the polypeptide is 
related. The peptide can be immunologically assayed in accordance with any of the known methods including the fluo- 
rescent antibody method, the passive agglutination method and the enzyme-labeled antibody technique. 

(5) Gene analysis of human organic tissues 

25 

Examples of the biological sample usable in the gene analysis include normal human tissues, various types of 
human tumor tissues, human blood, human bodily fluids and human secretions. The DNA of the employed tissue may 
be extracted and prepared by, for example, the method reported by Sato, T., et al. [Cancer Res.. 50. 71 84 (i 990)]. 

From the DNA sequence provided by the present invention, a part DNA sequence at an appropriate position is 
30 selected and a synthetic oligonucleotide having this sequence or one complementary thereto is used as a primer, a 
probe or a marker. Thus the occurrence of a mutation of this gene in man and the morphology of the mutation can be 
analyzed/Furthermore, alterations (insertion, deletion, etc.) of this gene in a sample can also be detected by these 
analyses. 

The part DNA sequence may be selected from any part of the DNA sequence of the above-mentioned gene. It is 
35 needless to say that an artificially modified DNA sequence may be used therefor and thus the corresponding gene 
mutation can be detected. 

The analysis may be effected by. for example, the following method. Namely, primers of two sequences are 
selected and the partial sequence between them is amplified by the PCR method. Then the amplified DNA sequence 
is directly analyzed. Alternatively, this amplification product is integrated into a plasmid in the same manner as that of 

w the above-mentioned case and host cells are transformed thereby. After culturing the transformant thus obtained, the 
DNA sequence of the clone thus obtained is analyzed. Further, the ligase chain reaction method may be applied to the 
amplification [Wu et al. . Genomics, 4, 560 - 569 (1 989)]. Furthermore, a specific mutation iftth»above-mentioned gene 
in a sample can be detected by using the allele-specific PCR [Ruano and Kidd. Nucleic Acid Research. V7, 8392 
(1989)] or the ARMS method [C.R. Newton et al., Nucleic Acid Research, J7, 2503 - 2517 (1989)]. 

45 Similarly, a point mutation can be detected by the SSCP method [Orita et al. , Proc. Natl. Acad. Sic. USA, 8£, 2766 
- 2770 (1989); and Genomics, 5, 874 - 879 (1989)] or the RNase-protection method with the use of probes containing 
the DNA sequence thus selected or an RNA sequence originating therein. By using these probes, a mutation in the 
above-mentioned gene in a sample can be detected by the Southern hybridization method or an abnormality in the 
expression level of this gene in a sample can be examined by the Northern hybridization method. 

so Escherichia coli DH5aF/pAB1 , pFL2 and pCE9 each carrying a plasmid containing the cDNA of this tumor sup- 
pressor gene were deposited with National Institute of Bioscience and Human-Technology, Agency of Industrial Sci- 
ence and Technology, Ministry of International Trade and Industry under accession numbers FERM P-14127, 14128 
and 1 41 29, respectively, on February 8, 1 994, and they were changed to International deposition under accession num- 
bers FERM BP-4923, 4924 and 4925, respectively, on December 9. 1994. 

55 The DNA of the present invention has a structure homologous with those of transcriptional factors, and originates 
in the most restricted commonly deleted region on chromosome 11 in MEN 1 -associated tumors. Therefore, it is 
expected that the DNA of the present invention may be a novel tumor suppressor gene. The DNA may be used as a tool 
in a gene therapy. Further, the fragment of the DNA may be used in the gene analysis of the DNA and in the diagnosis 
of diseases to which the DNA relates. 
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•h J=T^ COdedf0rbytheDNA accortfm 9 to the present invention may be used as a reagent for investiaa- 
n^S? « ? repa T a " tib0dy - 77,8 artibod y be used in the qualitative or quar^elnaLS^le 
polypeptide in a biological sample. Thus, it is expected that the antibody may be useful as a novel diagnKrttecfnjg! 

Brief Description of the Drawings 
meLoVJjjy^ 

deriv2 th'efefr^ 3 " 1 ^ ^ ^ "■** ^ °" e *« th6 d ™ structure of 2FM1 cDNA 

Rg. 3 is a diagram showing the homology of the 2FM1 protein with WT1 or EGR2. 

rtr ^VZ aa ^^T n9theC °' Mona ' 6X005 of * e ZFM1 Sene. The exons are represented by 1 to 14 The 
domams observed in cDNA are represented by A to H. ^ oy l to 14. The 



To further illustrate the present invention in greater detail and in particular, the followino Examnl^ win h» 
However n is to be understood that the present invention is not resmct^ese 9 

20 Example 1 1solation and linkage analysis of cosmid clones specific for chromosome 1 1 

„„,^ e ^ 01 «"*"*• rt ™* based on the linkage with a PYGM (muscfe glycogen phosohorvlase 

Nature. 3££85- 87 (1988)]. Subsequently, it was reported that it existed ina region of 12cM located betolZoiilii 

SZLinCT, hamsterAiuman hybnd cell l.ne corrtammg a single human chromosome 1 1 and screened cosmid 
clones conta-ning a part of the human chromosomal DNA with the use of a whole human DNA as probe TOktoS? 
Am J. Hum. Genet.. 43. 258 - 268 (1991); and Tanigani et al.. Am. J. Hum. Genet 50. 56 - 64 Si U" 

Example 2 Preparation of deletion map of chromosome 11 in MEN 1 -associated tumors 

On the other hand, investigations on the loss of heterozygosity (LOH) of the chromosome 1 1 in MEN1 associate 
« £T! S a ' S ? 1 SU99eS,ed ««» «** in «, e above-meXn^egior^ [ XS^STS 

^9 | J- M i « , -.32L213 - 218(1989);Thakkeretal..N. Engl.J. Med.. 32L 218 - 224 (1989 EtertJ ct'cer 
Res.. 5L 1154- 1157(1991)]. ft has been l^pM^Mtv^nw^aM^^^JZ^:' Tf 

87, 1968 - 1972 (1990)]. The results of the examination on LOH are arranged together ^ffwr^unTof thT^in^ 
« ^^"^uscon.deredmattheMENI gene e*sts in a region J^i^SZ^S^SSSi 

Example 3 Preparation of physical map of 1 1q13 region 

- ---^ 

danf.ed depending upon the capability of each done of being hybridized with a corrmjnaenornTc DNAfe^n^ a~ 
W has been fai^ an that 00111-4,^.11-367.^111^111-2^ 

coill-4. among these cosmid dones, are markers closest to the MEN1 gene fled values- 5 03 and «; 1<» tt.^ZZZ 

H""t Genet. 399-403(199^^ 
ton enzymes in YAC clones 1908F2 and 199A7 isolated by using PYGM aTprcle ft has SSSSSC S7T 
367. among the 6 cosmid dones as described above, is also dose to PYGM ^ 1 " 
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Example 4 Isolation of exon sequence from 1 1q13 region 

As described above. cCI1 1 -4 and cCI1 1-367 are cosmid clones which are closest to the MEN1 gene. Thus, an 
attempt was made to isolate exons from these 2 cosmid clones by the exon trapping method [Buckler, A., et al., Proa 

5 Natl. Acad. Sci. USA. §§, 4005 - 4009 (1991)]. The cosmid DNA was cleaved with Bglll or BamHI, or both of these 
enzymes, and the fragment thus obtained was linked to the BamHI site of an exon splicing vector pSPL1 . Transfection 
into COS-7 cells and isolation of exon sequences by the reverse transcription PCR (RT-PCR) were effected each in 
accordance with the procedure described in the original paper. Consequently, 3 exon sequences originating in cCH 1 - 
367 were obtained and named respectively S367EI, S367E2 and S367E4. These exon sequences were respectively in 

w sizes of 147 bp. 76 bp and 129 bp. 

Example 5 Isolation of full-length cDNA 

By using S367E4 (i.e., one of the exon sequences obtained in the above Example 4) as probe, a human cortical 

is cDNA library was screened. Thus, a clone AB1 carrying a cDNA insert of 1 kb was obtained. With the use of this clone 
AB1 as a probe, further, a cDNA done FL2 was obtained from a human fetal liver cDNA library while cDNA clones CE5, 
CE9 and CE16 were obtained from a human cerebellar cDNA library. Then, it was confirmed that each of these clones 
could be hybridized with the original cosmid done cC1 1 1 -367 and mapped on the chromosome 1 1q13 with a hybrid cell 
line panel. A sequence constructed by overlapping these cDNA clones one another at the common parts corresponded 

20 to ZFM1 cDNA of 3200 bp (SEQ ID NO:1). This ZFM1 cDNA contained an open reading frame (ORF) of 1869 bp which 
corresponded to a sequence of base Nos. 383 to 2251 in SEQ ID NO:1. Based on the information as will be described 
hereinbelow, it has been proved that the sequence of SEQ ID NO: 1 and that of each clone can be regarded as being 
composed of 6 domains A (base Nos. 1 to 413 in SEQ ID NO:1). B (base Nos. 414 to 542 in SEQ ID NO:1). C (base 
Nos. 543 to 618 in SEQ ID NO:1). D (base Nos. 619 to 1964 in SEQ ID NO:1). E (base Nos. 1965 to 2218 in SEQ ID 

25 NO:1) and F (base Nos. 2219 to 3200 in SEQ ID NO:1) and domains G and H which are completely different therefrom. 
Namely, the exon sequences S367E2 and S367E4 obtained in the above Example 4 corresponded respectively to the 
domains C and B. The cDNA done CE5 lacked in a domain E consisting of 254 base pairs corresponding to a sequence 
of base Nos. 1965 to 2218 in SEQ ID NO:1. which may be due to an alternative splidng. The cDNA done AB1 con- 
tained domains A and B and the different one G but not the domains C, D. E and F. The cDNA clone CE1 6 consisted of 

so the domains D and E and the different one H (see Fig. 2). 

Example 6 Charaderistics of the structure of protein coded for by the tumor suppressor gene 

A protein coded for by ZFM1 cDNA consisted of 623 amino add residues and had a nuclear localizing signal con- 

35 taining basic amino acids in the N-terminal side. Further, a sequence C-X2-C-X4-H-X4-C (amino add Nos. 279 - 292) 
had characteristics of a zinc finger motif existing in a DNA binding protein. 1 1 8 proline residues were contained in this 
ZFM1 protein. In particular. 69 prdine residues were contained in a region of amino acid Nos. 420 to 623 thereof. The 
sequence of this region showed high homologies with Wilms' tumor suppressor gene product (WT1) [Gessler et a)., 
Nature, 342, 774 - 778 (1990)] and early growth response 2 (EGR2) protein as a transcriptional factor (27.3% and 

*o 24.0%, respectively) (see Fig. 3). WT1 is a transcription factor having a Kruppel-like zinc finger motif [Rosenberg et al., 
Nature, 3J9, 336 - 339 (1986)]. EGR2 is a human homologue of an early growth response gene Krox-20 [Chavier et al., 
EM BO J. 7, 29 - 35 (1988)] which is expressed at the G0-G1 junction in the cell cyde of cjiltmed mouse cells and it is 
also a transcriptional factor [.Joseph et al., Proc. Natl. Acad. Sd. USA. 8§, 7164 - 7168 (1988)]. The ZFM1 protein fur- 
ther had 7 proline repetitive sequences (each consisting of at least 4 proline residues located continuously) in the C- 

45 terminal side. One of these repetitive sequences followed a glutamine repetitive sequence and thus formed a structure 
which was almost the same as that of the hinge domain of a rrineralocortjcoid receptor [Arriza et al.. Science, 23L 268 
- 275 (1987)]. Such a hinge structure is essentially required in the communication between a hormone binding domain 
and a DNA binding domain [Krust et al., EM BO J., 5, 891 - 897 (1986); and Giguere et al.. Cell, 46, 645 - 652 (1986)]. 
Further. mRNAs of a number of types originating in the ZFM1 gene were expressed in hormone-produdng organs such 

so as pancreas, thyroid, adrenal gland and ovarium (see Table 1 in Example 8). 

These facts indicate that the ZFM1 protein is a tumor suppressor gene which is localized in the nuclei and exerts 
its function by binding to DNA and thus suppressing the proliferation of cells and that ZFM1 is a gene which partidpates 
in the onset of MEN1. 

55 Example 7 Structure of genomic gene 

Based on the cosmid done containing the ZFM1 gene, the genomic strudure of the ZFM1 gene was determined. 
The ZFM1 gene existed over a region of about 20 kb in the genomic DNA and consisted of 14 exons (see Fig. 4). As 
Fig. 4 shows, it has been revealed that these exons (Nos. 1 to 1 4) and the domains A to H descrfoed in the above Exam- 



7 



1. domain B = exon 2. domain C = 
exon 13 and a part of exon 14, domain F = 



pie 6 relate to each other as follows: d_ 
5. 6. 7. 8, 9. 10. 11 and 12, domain E - 

2a. and domain H = exon 3a^ " """""l*" 10 "*™ ?«. domain F = a part of exon 14. domain G 
exon 14. On^eother hand. ^edc^Tc^e^ 

exon 2a which directly follows the exon 2 codino to Zd«£„ r £ ,^, 9 ** AB -G is coded for by the 

ing of the domains H-D-E is cod^ by^ Lon ? {^^SSSSS^^^^^^ 
domain D. racn s ,ocated ""mediately before the exon 4 coding for the 

Example 8 Expression of ZFM1 gene in human tissues 

5 « is considered that the larger mRNA coaes^toSS^^^*^ h a" 0 ' these tissues, 
the smaOer mRNA corresponds to one conSng *e Z! M^^T"? 1,16 d0nlainS «*te 
the expression of the ZFM1 gene in V JZ^£Z£ 22^1^ 2 >- * examine 

extracting FtNAs from various huma TtissuJ > S'S^L ^7^^°" (RT " PCR) ""^ was ^ected by 
respective domains. As a result the e^oTcT^ZuTn, ( ^ ' m R * 2 > ***** for the 

• differences in splicing, were dbLJ^Mr^l S t $ ^ ""i* were *» to 

^D.A-B^and^fseel^^^^^ 
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SEQUENCE LISTING 
(2) INFORMATION FOR SEQ ID NO: 1: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(il) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) FEATURE KEY: 5'UTR 

(B) LOCATION: 1 . . 382 

(A) FEATURE KEY: CDS 

(B) LOCATION: 383.. 2254 

(A) FEATURE KEY: exon 1 

(B) LOCATION: 1..413 

(A) FEATURE KEY: exon 2 

(B) LOCATION: 414. .542 

(A) FEATURE KEY: exon 3 

(B) LOCATION: 543.. 618 

(A) FEATURE KEY: exon 4 

(B) LOCATION: 619. .771 

(A) FEATURE KEY: exon 5 

(B) LOCATION: 772. .861 
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(A) FEATURE KEY: exon 6 

(B) LOCATION: 862.. 1045 

(A) FEATURE KEY: exon 7 

(B) LOCATION: 1046. .1161 

(A) FEATURE KEY: exon 8 

(B) LOCATION: 1162.. 1269 

(A) FEATURE KEY: exon 9 

(B) LOCATION: 1270.. 1450 

(A) FEATURE KEY: exon 10 

(B) LOCATION: 1451.. 1724 

(A) FEATURE KEY: exon 11 

(B) LOCATION: 1725.. 1784 

(A) FEATURE KEY: exon 12 

(B) LOCATION: 1785.. 1964 

(A) FEATURE KEY: exon 13 

(B) LOCATION: 1965.. 2137 

(A) FEATURE KEY: exon 14 

(B) LOCATION: 2138.. 3132 

(A) FEATURE KEY: 3'UTR 

(B) LOCATION: 2280.. 3200 ^ _ 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CGTTGCTGTC GAAATGAAGT GCGCGCTGCG ACACCTCCCA GCCCACCGAA CTCCGCCGCC 60 
ATTTCCTCGC TTGCCTAACG GTTCGGCCAA TCCCAGCGCG CATCAATGCC GGACTGAGGC 120 
TCCGCCAATC GGAGGCCGCC GATTTCGACC CTTCGCCTCG GCCCGGCCCA ATCCATTCCC 180 
CGGCCCCGCC GCCCCCGGCC CGCCCCCGCG GTGCCCTCTC TCCTCCCTCT TTGTGCGTCT 240 
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CGCGCCGCCG CCGCCCGCCG CGTGAGAGGA CGGGCTCCGC GCGCTCCGGC AGCGCATTCG 300 
GGTCCCCTCC CCCCGGGAGG CTTGCGAAGG AGAAGCCGCC GCAGAGGAAA AGCAGGTGCC 360 
GGTGCCTGTC CCCGGGGGCG CC ATG GCG ACC GGA GCG AAC GCC ACG CCG TTG 412 
Met Ala Thr Gly Ala Asn Ala Thr Pro Leu 
1 5 io 

GAC TTC CCA AGT AAG AAG CGG AAG AGG AGC CGC TGG AAC CAA GAC ACA 460 
Asp Phe Pro Ser Lys Lys Arg Lys Arg Ser Arg Trp Asn Gin Asp Thr 

15 20 25 

ATG GAA CAG CCG ACA GTG ATT CCA GGA ATG CCT ACA GTT ATT CCC CCT 508 
Met Glu Gin Pro Thr Val lie Pro Gly Met Pro Thr Val He Pro Pro 

30 35 40 

GGA CTT ACT CGA GAA CAA GAA AGA GCT TAT ATA GTG CAA CTG CAG ATA 556 
Gly Leu Thr Arg Glu Gin Glu Arg Ala Tyr He Val Gin Leu Gin He 

45 50 55 

GAA GAC CTG ACT CGT AAA CTG CGC ACA GGG GAC CTG GGC ATC CCC CCT 604 
Glu Asp Leu Thr Arg Lys Leu Arg Thr Gly Asp Leu Gly He Pro Pro 

60 65 70 

AAC CCT GAG GAC AGG TCC CCT TCC CCT GAG CCC ATC TAC AAT AGC GAG 652 
Asn Pro Glu Asp Arg Ser Pro Ser Pro Glu Pro He Tyr Asn Ser Glu 
75 80 85 ^ _ 90 

GGG AAG CGG CTT AAC ACC CGA GAG TTC CGC ACC CGC AAA AAG CTG GAA 700 
Gly Lys Arg Leu Asn Thr Arg Glu Phe Arg Thr Arg Lys Lys Leu Glu 
95 ioo 105 
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GAG GAG 
Glu Glu 

TTC AAG 
Phe Lys 

AAA GTC 
Lys Val 
140 
CTG CTC 
Leu Leu 
155 

TGC AAT 
Cys Asn 

AAG GTT 
Lys Val 

CTT CAT 
Leu His 

GTG GAA 
Val Glu 
220 



CGG CAC AAC CTC 
Arg His Asn Leu 
110 

CCA CCT GCA GAT 
Pro Pro Ala Asp 
125 

ATG ATT CCA CAA 
Met He Pro Gin 



ATC GGG 
He Gly 

GCC AAG 
Ala Lys 

GGG CGC 
Gly Arg 
190 
GCC CTG 
Ala Leu 
205 

CAG ATA 
Gin He 



CCC AGA 
Pro Arg 
160 
ATT ATG 
He Met 
175 

AAA GAT 
Lys Asp 

GTT ACT 
Val Thr 

AGA AAC 
Arg Asn 



ATC ACA 
He Thr 

TAC AAA 
Tyr Lys 
130 
GAT GAG 
Asp Glu 
145 

GGG AAC 
Gly Asn 

ATC CGG 
He Arg 

GGC CAG 
Gly Gin 

GCC AAT 
Ala Asn 
210 
ATC CTG 
He Leu 



GAG ATG 
Glu Met 
115 

CCT CCA 
Pro Pro 

TAC CCA 
Tyr Pro 

ACC CTG 
Thr Leu 

GGG AAA 
Gly Lys 
180 
ATG TTG 
Met Leu 
195 

ACA ATG 
Thr Met 

AAG CAG 
Lys Gin 



GTT GCA 
Val Ala 

GCA ACA 
Ala Thr 

GAA ATC 
Glu He 
150 
AAG AAC 
Lys Asn 
165 

GGG TCT 
Gly Ser 

CCA GGA 
Pro Gly 

GAG AAC 
Glu Asn 

GGT ATC 
Gly He 
230 



CTC AAT CCG GAT 
Leu Asn Pro Asp 
120 

CGT GTG AGT GAT 
Arg Val Ser Asp 
135 

AAC TTT GTG GGG 
Asn Phe Val Gly 

ATA GAG AAG GAG 
He Glu Lys Glu 
170 

GTG AAA GAA GGG 
Val Lys Glu Gly 
185 

GAA GAT GAG CCA 
Glu Asp Glu Pro 
200 

GTC AAA AAG GCA 
Val Lys^Lxs Ala 
215 

GAG ACT CCA GAG 
Glu Thr Pro Glu 
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GAC CAG AAT GAT 
Asp Gin Asn Asp 
235 

AAT GGG ACC CTT 
Asn Gly Thr Leu 



AGC TCA 
Ser Ser 



GGA GGG 
Gly Gly 



Asp Pro 
300 
TCC CTC 
Ser Leu 
315 

TCC ACC 
Ser Thr 



GGG ACC 
Gly Thr 
270 
GCT GGC 
Ala Gly 
285 

CAG TCA 
Gin Ser 



CTA CGG 
Leu Arg 
240 
CGG GAA 
Arg Glu 
255 

CGC AGC 
Arg Ser 



AAG ATG CAG CTT 
Lys Met Gin Leu 



CGG GAG 
Arg Glu 



GAC GAT 
Asp Asp 



ATT ACC 
He Thr 



CAC ATT 
His He 



GCT CAG 
Ala Gin 



ATG GCT 
Met Ala 



TCT GGG 
Ser Gly 



GCT GCT CCC GCC 
Ala Ala Pro Ala 



GAA CTG 
Glu Leu 
320 
CCT GCC 
Pro Ala 
335 

AAC AAC 
Asn Asn 



GCT TCA 
Ala Ser 
290 
GAT AAA 
Asp Lys 
305 

GGT GAA 
Gly Glu 



AAC AGG 
Asn Arg 
260 
AAC ACC 
Asn Thr 
275 

GAC TGT 
Asp Cys 



ATC TTA 
lie Leu 



ACA GTG 
Thr Val 



AAA TTC 
Lys Phe 



GCA CGG 
Ala Arg 



GCA CCT 
Ala Pro 



ACC ACA CCC CTG 
Thr Thr Pro Leu 
340 

CCA CCT CCA CCG 
Pro Pro Pro Pro 
355 



ATG GAT 
Met Asp 
310 
GTC CCA 
Val Pro 
325 

GCC AGC 
Ala Ser 

TCT CTC 
Ser Leu 



TTG GCT CGC TTA 
Leu Ala Arg Leu 
250 

AGA CCC TGG CAG 
Arg Pro Trp Gin 
265 

TGT ACC AAG TGT 
Cys Thr Lys Cys 
280 

CAA AGG CCT GGT 
Gin Arg Pro Gly 
295 

AAA GAA TAT TTG 
Lys Glu Tyr Leu 

GCA TCT GTG GGC 
Ala Ser Val Gly 
330 

GCA CCT CGT CCT 
Ala Pro Arg Pro 
345 

ATG TCT ACC ACC 
Met Ser Thr Thr 
360 



14 



EP0 727 486 A2 



CAG AGC CGC CCA CCC TGG ATG AAT TCT GGT CCT TCA GAG AGT TGG CCC 1516 
5 Gin Ser Arg Pro Pro Trp Met Asn Ser Gly Pro Ser Glu Ser Trp Pro 

365 370 375 

w TAC CAC GGC ATG CAT GGA GGT GGT CCT GGT GGG CCC GGA GGT GGC CCC 1564 
Tyr His Gly Met His Gly Gly Gly Pro Gly Gly Pro Gly Gly Gly Pro 

380 385 390 

CAC AGC TTC CCA CAC CCA TTA CCC AGC CTG ACA GGT GGG CAT GGT GGA 1612 
His Ser Phe Pro His Pro Leu Pro Ser Leu Thr Gly Gly His Gly Gly 
20 395 400 405 410 

CAT CCC ATG CAG CAC AAC CCC AAT GGA CCC CCA CCC CCT TGG ATG CAG 1660 
His Pro Met Gin His Asn Pro Asn Gly Pro Pro Pro Pro Trp Met Gin 

ss 

415 420 425 

CCA CCA CCA CCA CCG ATG AAC CAG GGC CCC CAC CCT CCT GGG CAC CAT 1708 
30 Pro Pro Pro Pro Pro Met Asn Gin Gly Pro His Pro Pro Gly His His 
430 435 440 

GGC CCT CCT CCA ATG GAT CAG TAC CTG GGA AGT ACG CCT GTG GGC TCT 1756 

35 

Gly Pro Pro Pro Met Asp Gin Tyr Leu Gly Ser Thr Pro Val Gly Ser 
445 450 455 

40 GGG GTC TAT CGC CTG CAT CAA GGA AAA GGT ATG ATG CCG CCA CCA CCT 1804 
Gly Val Tyr Arg Leu His Gin Gly Lys Gly Met Met Pro Pro*Pro Pro 

„ 460 465 470 

ATG GGC ATG ATG CCG CCG CCG CCG CCG CCT CCC AGT GGG CAG CCC CCA 1852 
Met Gly Met Met Pro Pro Pro Pro Pro Pro Pro Ser Gly Gin Pro Pro 
475 480 485' 490 
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CCC CCT CCC TCT GGT CCT CTT CCC CCA TGG CAA CAA CAG CAG CAG CAG 1900 
Pro Pro Pro Ser Gly Pro Leu Pro Pro Trp Gin Gin Gin Gin Gin Gin 

495 500 505 

CCT CCG CCA CCC CCT CCG CCC AGC AGC AGT ATG GCT TCC AGT ACC CCC 1948 
Pro Pro Pro Pro Pro Pro Pro Ser Ser Ser Met Ala Ser Ser Thr Pro 

510 515 520 

TTG CCA TGG CAG CAA AAT ACG ACG ACT ACC ACC ACG AGC GCT GGC ACA 1996 
Leu Pro Trp Gin Gin Asn Thr Thr Thr Thr Thr Thr Ser Ala Gly Thr 

525 530 535 

GGG TCC ATC CCG CCA TGG CAA CAG CAG CAG GCG GCT GCC GCA GCT TCT 2044 
Gly Ser He Pro Pro Trp Gin Gin Gin Gin Ala Ala Ala Ala Ala Ser 

540 545 550 

CCA GGA GCC CCT CAG ATG CAA GGC AAC CCC ACT ATG GTG CCC CTG CCC 2092 
Pro Gly Ala Pro Gin Met Gin Gly Asn Pro Thr Met Val Pro Leu Pro 
555 560 565 570 

CCC GGG GTC CAG CCG CCT CTG CCG CCT GGG GCC CCT CCC CCT CCG CCC 2140 
Pro Gly Val Gin Pro Pro Leu Pro Pro Gly Ala Pro Pro Pro Pro Pro 

575 580 585 

CGT. AGC ATC GAG TGT CTT CTT TGT CTT CTT TCT CTC CTC ACC CAA CTC 2188 
\rg Ser He Glu Cys Leu Leu Cys Leu Leu Ser Leu Leu Thr GLn_Leu 

590 595 600 

CCT TTG CCT CTC CCC AAA CCG GGC CGC CAG GAT CCC TCC CCG CGG CGG 2236 
Pro Leu Pro Leu Pro Lys Pro Gly Arg Gin Asp Pro Ser Pro Arg Arg 
S05 610 615 
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CGA TGG CCC GAG CCA TGAGAGTGAG GACTTTCCGC GCCCATTGGT GACCCTTCCA 2291 
Arg Trp Pro Glu Pro 



620 


623 








GGCAGACAGC 


CTCAGCAACG CCCCTGGTGG 


ACAGGATGGT TCGGCAAAGC 


AGCCTGAGTT 


2351 


ATTTTTGTGG 


ACGGAATCGG AACACGCTGG 


CTCCATATCG TGAAATTTTT 


ATTAATTTTT 


2411 


TTCTTTTTCC 


TTTGTTACTT CTTTATCTTT 


TCCTTTCTTC AGACTCCGTC 


CAAGGAGATG 


2471 


CTCTCCCCGG 


TCTTCTGCTG CAATTTAGAT 


TCCTTTGGGT TCTCTCCAGT 


TCTCCTTCCC 


2531 


TTACCAAGGA 


GAGGGGAGCA AATGGTTTTG 


GGCAAGGGCT TTGGCCATTC 


ATGTCAAGCT 


2591 


GGTTGTGGGT 


TTTTCAAGGT GCCATAGCCA 


CCCCCAAATA TGTTTGTTTA 


AAGCGTGGGG 


2651 


TTTTTTAATC 


TCTGCCACCC TTGTCAAGGG 


AGTCTTGTAA AGTTGCCGAG 


GGTAGGTTCA 


2711 


TCTCCAGGTT 


TCGGGATTCC CATCCGTCCT 


GGCGATCCTG CCAGCAGTGG 


GTGGGCAGCC 


2771 


TGAGCTCCCT 


CGGGCTCGCC TGCCAGCCTG 


GAGTTCTTCC TGTGCTCCTT 


GATCACCTGA 


2831 


GCTGCCTCAG 


ATTCCATTTG GTCCTCTCCT 


TCCTGGAAGG CTTCCTTTTA 


TGTTTTGTTT 


2891 


TAATCCCAAA 


TGTCTGAATG TTTTGCAGTG 


TGTAGGGGTT TGAGCCCCTT 


GTTCATTCTC 


2951 


CTTCCTTTTT 


CCTCCCGCTT CCCTCTCCAT 


GAAGTGATTC TGTTGACAAT 


AATGTATACT 


3011 


GCGCGTTCTC 


TTCACTGGTT , TATCTGCAGA 


AATTTCTCTG GGCTTTTTTC 


GGTGTTAGAT 


3071 


TCAACACTGC 


GCTAAAGCGG , GGATGTTCCA , TTGAATAAAA GAGCAGTGTG 


GTTTTCTGGG 


3131 


AAAAAAAAAA 


AAAAAAAAAA. AAAAAAAAAA. AAAAAAAAAA AAAAAAAAAA 


AAAAAAAAAA 


3191 


AAAAAAAAA 








3200 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(A) NAME: Cancer Institute 

(B) STREET: 37-1, Kamiikebukuro 1-chome. 

(C) CITY: Toshima-ku, Tokyo 
(E) COUNTRY: Japan 

[F> POSTAL CODE (ZIP) : none 

(A) NAME: Eisai Co., Ltd. 

<B) STREET: 6-10, Koiahikawa 4-chome, 

(C) CITY: Bunkyo-ku, Tokyo 

(E) COUNTRY: JAPAN 

(F) POSTAL CODE (ZIP) : 112 

(ii) TITLE OF INVENTION: Tumor Suppressor Gene 
(lii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release ffl.O, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: 5'UTR 

(B) LOCATION: 1 . . 382 



(B) LOCATION : 1 . .413 

(ix) FEATURE: 

(A) NAME/ KEY: exon 2 

(B) LOCATION: 414 . .542 

(ix) FEATURE: 

(A) NAME/KEY: exon 3 

(B) LOCATION: 543. .618 

(ix) FEATURE: 

(A) NAME/ KEY: exon 4 

(B) LOCATION: 619. .771 
(ix) FEATURE: 

(A) NAME/ KEY: exon 5 

(B) LOCATION:772..861 
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(ix) FEATURE : 

(A) NAME/ KEY: exon 6 

(B) LOCATION: 8 62.. 1045 



(ix) FEATURE: 

(A) NAME/ KEY: exon 9 

(B) LOCATION: 1270. .1450 

(ix) FEATURE: 

(A) NAME/KEY: exon 10 

(B) LOCATION: 1451. . 1724 

(ix) FEATURE: 

(A) NAME/ KEY: exon 11 

(B) LOCATION: 1725. .1784 

(ix) FEATURE: 

(A) NAME/ KEY: exon 12 

(B) LOCATION: 17 85. .1964 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CGTTGCTGTC GAAATGAAGT GCGCGCTGCG ACACCTCCCA GCCCACCGAA CTCCGCCGCC 
ATTTCCTCGC TTGCCTAACG GTTCGGCCAA TCCCAGCGCG CATCAATGCC GGACTGAGGC 
TCCGCCAATC GGAGGCCGCC GATTTCGACC CTTCGCCTCG GCCCGGCCCA ATCCATTCCC 
CGGCCCCGCC GCCCCCGGCC CGCCCCCGCG GTGCCCTCTC TCCTCCCTCT TTGTGCGTCT 
CGCGCCGCCG CCGCCCGCCG CGTGAGAGGA CGGGCTCCGC GCGCTCCGGC AGCGCATWG 
GGTCCCCTCC CCCCGGGAGG CTTGCGAAGG AGAAGCCGCC GCAGAGGAAA AGCAGGTGCC 



55 
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ATG GAA CAG CCG ACA GTG ATT C 



45 50 55 

GAA GAC CTG ACT CCT AAA CTG CGC ACA GGG GAC err cuzi- „„„ 

Glu Asp Leu Thr Arg Lys Leu Ar, T^ G^ £u g£ Pro p" 
U 65 70 

AAC CCT GAG GAC AGG TCC CCT TCC CCT GAG CCC ATC TAC AAT AGC GAG 

Asn Pro Glu Asp Ar, Ser Pro Ser Pro Glu Pro Ile T £ j« ^ «2 

80 8S go 

GGG AAG CGG CTT AAC ACC CGA GAG TTC CGC ACC CGC AAA AAG CTG GAA 7O0 
Gly Lys Arg Leu Asn Thr Arg Glu Phe Arg Thr Arg Ly, Lys Leu Glu 
95 100 105 

5^ ™ G = GG AAC CTC ATC ACA GAG ATG GTT GCA CTC AAT CCG GAT 748 

Glu Glu Arg His Asn Leu lie Thr Glu Met Val Ala Leu Asn Pro Asp 

110 115 120 

TTC AAG CCA CCT GCA GAT TAC AAA CCT CCA GCA ACA CGT GTG ACT GAT 796 
Phe Lys Pro Pro Ala Asp Tyr Lys Pro Pro Ala Thr Arg Val S « 
t -" 130 135 

AAA GTC ATG ATT CCA CAA GAT GAG TAC CCA GAA ATC AAC TTT GTG GGG 

Lys Val Met lie Pro Gin Asp Glu Tyr Pro Glu ~ J™ £ G GGG 844 

£u ^ ^ = CC AGA GGG ACC CTG MG ^ ATA GAG AAG GAG 892 
Leu Leu lie Gly Pro Arg Gly Asn Thr Leu Lys Asn lie Glu Lys Glu 



2 Arg Gly Lys Gly Ser Val Lys Glu Gly 

AAG GTT GGG CGC AAA GAT GGC CAG ATG TTG CCA GGA GAA GAT GAG CCA 
Lys Val Gly Arg Lys Asp Gly Gin Met Leu Pro Gly Glu Asp Glu Pro 

190 195 200 

CTT CAT GCC CTG GTT ACT GCC AAT ACA ATG GAG AAC GTC AAA AAG rra 
Leu Hx. Ala Leu Val Thr Ala Asn Thr Met Glu Asn Val Lys Ly3 Ala 

Vkl Gl*u T? A MC CTG 00 061 ATC GAG ACT CCA GAG 

Val Glu Gin lie Arg Asn Ile Leu Lys Gin Gly Ile Glu Thr Pro Glu 
220 225 2 30 

GAC CAG AAT GAT CTA CGG AAG ATG CAG CTT CGG GAG TTG GCT CGC TTA* - 
Asp Gin Asn Asp Leu Arg Lys Met Gin Leu Arg Glu Leu Ala Arg Leu 
235 240 245 * 

AAT GGG ACC CTT CGG GAA GAC GAT AAC AGG ATC TTA AGA CCC TGG CAG 
Asn Gly Thr Leu Arg Glu Asp Asp Asn Arg Ile Leu Arg Pro Trp Gin 

2 '5 260 265 

AGC TCA GGG ACC CGC AGC ATT ACC AAC ACC ACA GTG TGT ACC AAG TGT 
Ser Ser Gly Thr Arg Ser lie Thr Asn Thr Thr Val Cys Thr Lys Cys 

270 275 280 

GGA GGG GCT GGC CAC ATT GCT TCA GAC TGT AAA TTC CAA AGG CCT GGT 
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Gly Gly Ala Gly His He Ala Ser Asp Cys Lys Phe Gin Arq Pro 61/ 



Z TCT GGG CCT GCC A 
c Ser Gly Pro Ala T 



\ CCC CTG GCC AGC G 



r TCT GGT CCT TCA G 
n Ser Gly Pro Ser G 



\ CAC CCA TTA CCC A 
3 His Pro Leu Pro S 



3 CAC AAC CCC AAT G 
i His Asn Pro Asn G 



T ACG CCT GTG GGC TCT 
r Thr Pro Val Gly Ser 



T CCC TCT GGT CCT C 



TTG CCA TGG CAG CAA AAT S 
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= CT » G MG «A AAC CCC ACT AT3 GTG CCC CTG CCC 

Pro Gly Ala Pro Gin Net Gin Gly Asn Pro Thr Met Val Pro ZZ Pro 
560 565 570 

575 5B0 585 

^ *? C GAG TGT CTT CTT TGT CTT CTT TCT CTC CTC ACC CAA CTC 

Arg Ser lie Glu Cys Leu Leu Cys Leu Leu Ser Leu Leu Thr Gin "u 

CCT TTG CCT CTC CCC AAA CCG GGC CGC CAG GAT CCC TCC CCG CGG COT 
Pro Leu Pro Leu Pro Lys Pro Gly Ar, Gin Asp Pro Ser Pro Art A^ 
003 610 615 

Art Trp Pro Glu £ ™* «*»?»« 
620 

CCTTCCAGGC AGACAGCCTC AGCAACGCCC CTGGTGGACA GGATGGTTCG GCAAAGCAGC 
CTGAGTTATT TTTGTGGACG GAATCGGAAC ACGCTGGCTC CATATCGTGA AATTTTTATT 
AATTTTTTTC TTTTTCCTTr GTTACTTCTT TATCTTTTCC TTTCTTCAGA CTCCGTCCAA 
GGAGATGCTC TCCCCGGTCT TCTGCTGCAA TTTAGATTCC TTTGGGTTCT CTCCACTTCT 
CCTTCCCTTA CCAAGGAGAG GGGAGCAAAT GGTTTTGGGC AAGGGCTTTG GCCATTCATG 
TCAAGCTGGT TGTGGGTTTT TCAAGGTGCC ATAGCCACCC CCAAATATGT TTGTTTAAAG 
CGTGGGGTTT TTTAATCTCT GCCACCCTTG TCAAGGGAGT CTTGTAAAGT TGCCGAGGGT 
AGGTTCATCT CCAGGTTTCG GGATTCCCAT CCGTCCTGGC GATCCTGCCA GCAGTGGGTG 
GGCAGCCTGA GCTCCCTCGG GCTCGCCTGC CAGCCTGGAG TTCTTCCTGT GCTCCTTGAT 
CACCTGAGCT GCCTCAGATT CCATTTGGTC CTCTCCTTCC TGGAAGGCTT CCTTTTATGT 
TTTGTTTTAA TCCCAAATGT CTGAATGTTT TGCAGTGTGT AGGGGTTTGA GCCCCTTGTT 
CATTCTCCTT CCTTTTTCCT CCCGCTTCCC TCTCCATGAA GTGATTCTGT TGACAATAAT 
GTATACTGCG CGTTCTCTTC ACTGGTTTAT CTGCAGAAAT TTCTCTGGGC TTTTTTCGGT 
GTTAGATTCA ACACTGCGCT AAAGCGGGGA TGTTCCATTG AATAAAAGAG CAGTGTGGTT 
TTCTGGGAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAA 

(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



2344 

2404 

2464 

2524 

2584 

2644 

2704 

2764 

2824 

2884 

2944 

3004 

3064 

3124 

3184 

3200 
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Met Ala Thr Gly Ala Asn Ala Thr Pro Leu Asp Phe Fro Ser Lys Lys 
15 10 15 

Arg Lys Arg Ser Arg Trp Asn Gin Asp Thr Met Glu Gin Pro Thr Val 
20 25 30 

He Pro Gly Met Pro Thr Val He Pro Pro Gly Leu Thr Arg Glu Gin 
35 40 45 

Glu Arg Ala Tyr He Val Gin Leu Gin He Glu Asp Leu Thr Arg Lys 
50 55 60 

Leu Arg Thr Gly Asp Leu Gly He Pro Pro Asn Pro Glu Asp Arg Ser 
65 70 75 80 

Pro Ser Pro Glu Pro He Tyr Asn Ser Glu Gly Lys Arg Leu Asn Thr 
85 90 95 

Arg Glu Phe Arg Thr Arg Lys Lys Leu Glu Glu Glu Arg His Asn Leu 
100 105 110 

He Thr Glu Met Val Ala Leu Asn Pro Asp Phe Lys Pro Pro Ala Asp 
115 120 125 

Tyr Lys Pro Pro Ala Thr Arg Val Ser Asp Lys Val Met He Pro Gin 
130 135 140 

Asp Glu Tyr Pro Glu He Asn Phe Val Gly Leu Leu He Gly Pro Arg 
145 150 155 160 

Gly Asn Thr Leu Lys Asn He Glu Lys Glu Cys Asn Ala Lys He Met 
165 170 175 

He Arg Gly Lys Gly Ser Val Lys Glu Gly Lys Val Gly Arg Lys Asp 
180 105 190 

Gly Gin Met Leu Pro Gly Glu Asp Glu Pro Leu His Ala Leu Val Thr 
195 200 205 

Ala Asn Thr Met Glu Asn Val Lys Lys Ala Val Glu Gin He Arg Asn 
210 215 220 

He Leu Lys Gin Gly He Glu Thr Pro Glu Asp Gin Asn Asp Leu Arg 
225 230 235 240 

Lys Met Gin Leu Arg Glu Leu Ala Arg Leu Asn Gly Thr Leu Arg Glu 
245 250 255 

Asp Asp Asn Arg He Leu Arg Pro Trp Gin Ser Ser Gly Thr Arg Ser 
260 265 270 

He Thr Asn Thr Thr Val Cys Thr Lys Cys Gly Gly Ala Glj» Me He 
275 280 285 

Ala Ser Asp Cys Lys Phe Gin Arg Pro Gly Asp Pro Gin Ser Ala Gin 
290 295 300 

Asp Lys Ala Arg Met Asp Lys Glu Tyr Leu Ser Leu Met Ala Glu Leu 
305 310 315 320 

Gly Glu Ala Pro Val Pro Ala Ser Val Gly Ser Thr Ser Gly Pro Ala 
325 330 335 

Thr Thr Pro Leu Ala Ser Ala Pro Arg Pro Ala Ala Pro Ala Asn Asn 
340 345 350 
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Met Asn Ser Gly Pro Ser Glu Ser Trp Pro Tyr His Gly Met His Gly 
370 375 380 

Gly Gly Pro Gly Gly Pro Gly Gly Gly Pro His Ser Phe Pro His Pro 
385 390 395 4 oo 

Leu Pro Ser Leu Thr Gly Gly His Gly Gly His Pro Met Gin His Asn 
405 410 415 

Pro Asn Gly Pro Pro Pro Pro Trp Met Gin Pro Pro Pro Pro Pro Met 
420 425 430 

Asn Gin Gly Pro His Pro Pro Gly His His Gly Pro Pro Pro Met Asp 



Gin Gly Lys Gly Met Met Pro Pro Pro Pro Met Gly Met Met Pro Pro 
«« 470 475 480 



Leu Pro Pro Trp Gin Gin Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro 
500 505 510 

Pro Ser Ser Ser Met Ala Ser Ser Thr Pro Leu Pro Trp Gin Gin Asn 
515 520 525 

Thr Thr Thr Thr Thr Thr Ser Ala Gly Thr Gly Ser lie Pro Pro Tro 
530 535 540 

Gin Gin Gin Gin Ala Ala Ala Ala Ala Ser Pro Gly Ala Pro Gin Met 
545 550 555 56 0 

Gin Gly Asn Pro Thr Met Val Pro Leu Pro Pro Gly Val Gin Pro Pro 
565 570 575 

Leu Pro Pro Gly Ala Pro Pro Pro Pro Pro Arg Ser He Glu Cys Leu 



1 ' er a DM essential* equal 

to the DNA compnsmg the full structure or a part of the DNA represented by SEQ ID NO:1 ^ 



2. Ar^ly^ecornpriGing the full structure or a part of the polypeptide coded for by the DNA represented by SEQ 
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A transformant transformed by a plasmM having, integrated therein, the full structure or a part of the DNA repre- 
sented by SEQ ID NO:1 which can be expressed therein. 

An antibody against the polypeptide as set forth in Claim 2 as an antigen. 

A gene analysis method which comprises using, as a primer, a probe or a marker, a DNA comprising a part of the 
DNA represented by SEQ ID NO:1 and hybridizing the primer, the probe or the marker with a DNA to be tested. 
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(57) A detailed genetic map on human chromosome 
11 was prepared. Then, a commonly deleted region on 
the chromosome in the tumor tissues of patients with 
multiple endocrine neoplasia type 1 was identified. Fur- 
ther, by the linkage analysis on a family line with this dis- 
ease, a gene causative of this disease was localized. A 
gene present in the region common to these observa- 
tions was cloned and the structure of this gene was 
determined. Because a protein coded by this DNA is 
homologous with those of transcriptional factors, it is 
expected that the above-mentioned gene may be a 
novel tumor suppressor gene. Further, it is also 
expected thai the above-mentioned gene and a protein 
coded for thereby may be useful in preparations of a 
remedy for cancer and a diagnostic drug for cancer 
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